(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
9 August 2001 (09.08.2001) 




PCT 



i am iQiini u iiiiti urn mi i ii in Din mil urn mil mi limn mi nil mi 

(10) International Publication Number 

WO 01/57676 Al 



(51) International Patent Classification 7 : G06F 12/14 

(21) International Application Number: PCT/US0 1/02560 

(22) International Filing Date: 25 January 2001 (25.01.2001) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

09/495,813 
09/608,072 



1 February 2000 (0 1 .02.2000) US 
30 June 2000 (30.06.2000) US 



(71) Applicant: CIRRUS LOGIC, INC. [US/US]; P.O. Box 
17847, Austin, TX 78760 (US). 

(72) Inventors: NORTH, Gregory, Allen; 804 Weston Lane, 
Austin, TX 78733 (US). PERRY, Matthew, Richard; 
7913 High Hollow Drive, Austin, TX 78750 (US). 
KIRCHER, Brian, Christopher; 2204 Frontier Trail, 
Round Rock, TX 78681 (US). 



(74) Agent: MURPHY, James, J.; Winstead Sechrest & 
Minick P.C., Suite 5400, 1201 Main Street, Dallas, TX 
75250-0784 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, 
DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, 
ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, 
LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, 
NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, 
TT, TZ, UA, UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TO). 

Published: 

— with international search report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



= (54) Title: METHODS FOR SYNTHESIZING TRANSLATION TABLES AND SYSTEMS USING THE SAME 



IT) 



100 



109. 
3.6864 UH^[ 
111 

32-768 KhT^ 



PLL 



32,768-KHz 
OSCILLATOR 



POR. RUN, 
RESET, WAKEUP 

BAT0K, EXTPWR 
PWRFL. BATCHC" 

EINTfl-3). FIQ_ 
MEDCHC 

FLASHING 
LEO DRIVE 
PORTS A, B. 
D 

PORT E 



114- 



110' 



STATE 
CONTROL 



4" 



RTC 



(5-BIT)- 
(3-BIT) 



129 



GPIO 



OC TO DC 

ADCCLK, ADCIN, 
AOCOUT. SUPCLK.- 

ADCCS 
SISCLK, SSITXFR. 
SSITXDA, SSIRXDA,- 
SS1RSFR 



130 



127 



128- 



SERIAL 
INTERFACE 



-108 



115^ 


POWER 




MANAGEMENT 



116-^ 


INTERRUPT 




CONTROLLER 



SSI (ADC) f— 



101 

A. 



ARM720T 



0ARM7TDMI 
CPU CORE 



B-K8YTE 
CACHE 



-102 



-103 



-C 



WRITE 
BUFFER 



L-104 
105 



TIMER 
COUNTER (2) 



ON-CHIP 
BOOT ROM 



-113 



-117 



136 



106- 



SECURITY 
REGISTERS AND 
HARDWARE 



INTERNAL DATA BUS 



106 



121- 



FLASH 
INTERFACE 



-A J-—D0-D31 



119- 



EXPANSION 
CONTROL 



120- 



DRAM 
CONTROLLER 

107 

) INTERNAL ADDRESS BUS 



123 



134 



A. 



LCD DMA 



1CE-JTAG 



0 



FLASH CARD. 
"FLASH MEM 

EXPCLK. WORO. 
►CS(0-3]. EXPR0Y. 
WRITE 

PARALLEL 
PORT, EPP 



A[0-27], 
133 



122- 



118- 
-107 



LCD 




CONTROLLER 






SRAM 




373 KBYTES 





„ TEST AND 
DEVELOPMENT 



-LCD DRIVE 



APB BUS 



APB BRIDGE | 
131 



124 126 ^ K 
UART1 1 * I 



UART2 



132 



3c 



125 



LED AND 
PH0T00I0DE 

ASYNC 

INTERFACE 1 
ASYNC 
* INTERFACE 2 



o 



(57) Abstract: A method of synthesizing translation tables includes the step of setting up at least one register for storing information 
controlling access to a plurality of memory spaces. A virtual address is generated which includes a pointer to selected information in 
the register controlling access to a selected one of the memory spaces. The selected information is accessed at the pointer from the 
register and a physical address to the selected one of the memory spaces is generated from the information accessed from the register. 



0157676A1J _> 



WO 01/57676 



PCT/USOl/02560 



METHODS FOR SYNTHESIZING TRANSLATION TABLES 
AND SYSTEMS USING THE SAME 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

The present invention relates in general to electronic appliances and in 
particular to circuits, systems and methods for information privatization in personal 
electronic appliances. 

DESCRIPTION OF THE RELATED ART 

Handheld personal electronic appliances have become increasingly popular 
as new technologies have allowed for the production of affordable devices with a 
high degree of functionality. One such device is the portable digital audio player, 
which downloads digital audio data, stores those data in a read-writeable memory, 
and converts those data into audio on user demand. The digital data is downloaded 
from a network or retrieved from a fixed medium, such as a compact disk, in one 
of several forms, including the MPEG Layer 3, ACC, and MS Audio protocols. An 
audio decoder, supported by appropriate firmware, retrieves the encoded data from 
memory, applies the corresponding decoding algorithm and coverts the decoded 
data into analog form for driving a headset or other portable speaker system. 

In order to prevent the unauthorized download of copyrighted material, 
such as music segments, some means of controlling the operation of a the personal 
appliance is desirable. This could be implemented for example through the 
issuance of passwords or software kernels authorizing download the relevant 
information. The passwords or software must be secured to prevent end user 
copying, distribution and tampering. Moreover, the audio decoder may be 
operating from proprietary firmware which must also be protected against copying 
or tampering. 



1 



0157676A1 J _> 



WO 01/57676 



PCT/US01/02560 



In sum, what are needed are methods, circuits and systems for securing 
information in personal digital appliances. To this end, the ability to secure this 
information should not depend on where in the appliance it is stored, whether it be 
in memory internal or external to the primary processing chip. Moreover, 
5 implementation of security should not waste resources, such as the available 

memory space, which could be used more directly for processing operations. 
Additionally, the security methods and hardware should be applicable to a wide 
range of different system configurations. 

1 0 SUMMARY OF THE INVENTION 

According to the principles of the present invention, 
a system is disclosed which includes a central processing unit operating in response 
to a set of instructions for processing information. An interface is included which 
provides access to selected circuitry forming a part of the system by an external 

1 5 device. A set of non-volatile programmable security elements selectively enable 

and disable the operation of the interface to provide a private environment for 
processing the information. The principles of the present invention provide, among 
other things the ability to privatize information in personal digital appliances. 
These principles can be implemented in a manner which does not waste processing 

2 0 resources, such as available memory space, which could be more directly used for 

processing operations. Moreover, these principles can be applied to a wide range 
of different system configurations that do not depend on where in the appliance the 
private information is to be stored, whether it be in memory internal or external to 
the primary processing chip. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present invention, and the advantages 
thereof, reference is now made to the following descriptions taken in conjunction 
with the accompanying drawings, in which: 

2 
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FIGURE 1 A is a high level functional block diagram of an integrated circuit 
embodying the principles of the present invention; 

FIGURE IB is a high level diagram of a second system embodying the 
inventive concepts; 

5 FIGURE 1C is a third exemplary system to which the present inventive 

principles can be advantageously applied; 
and ID are two additional; 

FIGURE 2 depicts integrated circuit 100 in a maximum utilization 

configuration, 

L0 FIGURE 3 is a high level functional block diagram of the processor 

depicted in FIGURE IB; 

FIGURES 4A depicts the external clock driving a pin EXPCLK when the 
clock enable signal on pin CLKEN is asserted with the system entering the Standby 
State; 

15 FIGURE 4B depicts the external clock driving a pin EXPCLK when the 

clock enable signal on pin CLKEN is asserted and the system is exiting the Standby 
State; 

FIGURE 5 is a state diagram illustrating the operation of the state control 

circuitry of FIGURE 1 A; 
2 0 FIGURE 6 is a block diagram of the three serial interfaces comprising the 

serial interface block of FIGURE 1 A; 

FIGURES 7A and 7B are timing diagrams illustrating the operation of the 
SSI (ADC) in conjunction with selected external devices; 

FIGURE 8 is a timing diagram illustrating the operation of the Codec 

2 5 interface of FIGURE 6; 

FIGURE 9 is a functional block diagram showing an interface between the 

I 2 S port of the serial interface block of FIGURE 6; 

FIGURE 10 is a timing diagram illustrating the operation of the I 2 S 
interface of FIGURE 9; 

3 
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FIGURE 1 1 is a functional block diagram illustrating the use of the SSI2 
port of FIGURE 6 in a master-slave configuration; 

FIGURE 12 is a flow chart describing system initialization at power-on 

reset; 

5 FIGURE 1 3 is a flow chart illustrating a procedure for locking private data 

in TLB; 

FIGURE 1 4 illustrates a cache lockdown procedure for locking secure code 
into cache; 

FIGURE 15 is a flow chart in which an emulated cache miss procedure is 
10 set forth; 

FIGURE 16A illustrates a preferred method of setting-up synthesized 
translation tables; 

FIGURE 16B is a flow diagram illustrating a table walk through the 
synthesized tables of FIGURE 16A; and 
1 5 FIGURES 1 7 A through 1 7E illustrate a preferred procedure for performing 

an emulated table walk. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The principles of the present invention and their advantages are best 
understood by referring to the illustrated embodiment depicted in FIGURES 1 A 
through 17E 

of the drawings, in which like numbers designate like parts. 

FIGURE 1 A is a high level functional block diagram of an integrated circuit 
1 00 embodying the principles of the present invention. Integrated circuit 1 00 could 
be, for example a Cirrus Logic EPxx integrated circuit. Integrated circuit 100 can 
advantageously be utilized in a number of consumer and industrial handheld 
information appliances, including personal digital assistants, electronics organizers, 
and two-way pagers, among other things. In particular, integrated circuit 100 can 

4 
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be configured to perform audio processing in battery powered internet audio 
decoders. 

Two additional exemplary systems to which the present inventive principles 
can be advantageously applied are shown in FIGURES IB and 1C and will be 
discussed further below. 

FIGURE 2 depicts integrated circuit in a system configuration, and will be 
referenced during the discussion of the input/output signals (ports) of the various 
functional blocks of integrated circuit 100. 

Integrated circuit 100 is built around an ARM720T processor 101 as 
described in the ARM720T data sheet available from ARM, Ltd., Cambridge, 
United Kingdom. Generally, processor 101 includes a central processing unit 
(CPU) core 1 02, 8-kilobyte cache 1 03, memory management unit (MMU) 1 04 and 
write buffer 105, each of which will be described in further detail below. It should 
be noted that in alternate embodiments, an ARM920 processor may also be used. 
15 CPU 102 is a 32-bit microprocessor based on a reduced instruction set 

computer (RISC) architecture. The associate 8-kilobyte cache 103 is a mixed 
instruction and data cache (IDC) and is organized as a four way set-associative 
cache of 512 lines of 16 bytes (4 words). 

MMU 104 includes a translation look aside buffer (TLB), access control 
2 0 logic and translation-table-walking logic. The primary functions of MMU 104 are 

the translation of virtual addresses into physical addresses and the control of 
accesses to memory. It also supports a conventional two-level page-table structure. 
Generally, the TLB encaches 64 translated entries and provides the translation to 
the associated access control logic. If a virtual address causes a hit to a translated 
25 entry in the TLB, the access control logic determines whether the access is 

permitted. In the case of a permitted access, MMU 104 outputs the corresponding 
physical address from the TLB cache. Otherwise, if the access is not permitted, 
MMU 1 04 signals CPU 1 02 to execute an abort. If a virtual address causes a miss 
to the TLB cache, the translation-table-walking circuitry retrieves the necessary 
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translation information from a translation table in physical memory. This 
translation information is written into the TLB cache at a replacement point or 
entry. The access control logic can then determine whether or not the access is 
allowed. 

Write buffer 105 is used to buffer up to eight words of data and four 
independent addresses. When enabled, CPU 102 writes data or an instruction into 
write buffer 105 using an external clock and then returns to instruction execution. 
Write buffer 105 can then, in parallel, write data onto internal data bus 106 and 
addresses onto internal address bus 107. 

An on-chip phase locked loop (PLL) 1 08, driven by a 3 .6864 MHz crystal 
1 09 is used in one mode to provide the clocks to processor 1 01 . In embodiments 
where the ARM720T is used, the primary (CPU) clock can be programmed to 
either 18.432 MHz, 36.864 MHz, 49.152 MHz or 73.728 MHz. (PLL 108 
preferably runs at twice the highest possible CPU clock frequency or 147.456 
MHz). When the CPU clock frequency is selected to be 36.864 MHz, internal data 
bus 106 and internal address bus 107 are also clocked at approximately 36 MHz. 
For CPU clock frequencies above 36 MHz, only processor 101 runs at the higher 
clock rate, with internal data bus 106 and internal address bus 107 being clocked 
at the 36 MHz rate. The CPU clock frequency is selected by programming a 
two-bit register field in the system control register SYSCON3. (A list of registers 
internal to integrated circuit 100 is provided as Table 1 ; a complete description of 
those registers is found in the Cirrus Logic EP7211 Preliminary Data Sheet, 
incorporated herein by reference. 

It should be noted that integrated circuit 100 also includes and external 
clock input which allows for the input of an external 13 MHz clock for driving 
substantially all of the on-chip circuitry in a second clocking mode. The external 
clock drives a pin EXPCLK when the clock enable signal on pin CLKEN is 
asserted as shown in FIGURES 4A and 4B, where FIGURE 4A shows integrated 

6 
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circuit 100 entering the Standby State and FIGURE 4B exiting the Standby State. 
(The Standby State is discussed further below). 

Oscillator 1 10 is used to generate a 1 hertz clock which is used to drive a 
32-bit real time clock generator (RTC) 1 12. RTC 1 12 can be written to or read 
from and includes a 32-bit output match register which allows the issuance of an 
interrupt when the time in the RTC matches a predetermined specific time. RTC 
1 12 is also used to drive a programmable LED flasher (not shown). 

Additionally, integrated circuit 100 includes a pair of on-chip timer 
counters 113. Each timer counter is independent and includes a 16-bit 
readable-writeabledata register. The given counter is loaded to a desired value and 
then decrements in response to a preselected clock. When the timer counter 
underflows (i.e., reaches zero) the appropriate interrupt is generated. The timer 
counter registers can be read at any time. The clock frequency of these timers can 
be selected by writing to corresponding bits in the system control registers 
SYSCON. For example, when PLL 108 is sourcing the internal clocks, 512 kHz 
and 2 kHz rates are available to timer counters 113. When a 13 MHz clock from 
an external source is used, 541 kHz and 2.115 kHz clocks are available. 
Additionally by using a Divide by 26 circuitry, enabled by setting a bit in system 
control register SYSCON2, a 500 kHz clock can also be generated from the 13 

2 0 MHz source. 

Each timer counter 113 can operate in either a free running mode or a 
prescale mode by setting or clearing bits in system control register SYSCON1 . In 
the free running mode, the given counter wraps around to OxFFFF when it 
underflows (i.e. reaches zero) and continues to count down. In the prescale mode, 
the value written into the given timer counter is automatically reloaded when the 
counter underflows. The prescale mode can be used to produce a programmable 
frequency, drive a buzzer, or generate a periodic interrupt. 

State control circuitry 1 14 allows integrated circuit 100 to be set to either 
an Operating, Idle, or Standby state. A state diagram illustrating the operation of 
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state control circuitry 114 is shown in FIGURE 5. The Operating state is the 
normal program execution state and all clocks and peripheral logic are enabled. 
The Idle state is similar to the Operating state with the exception that the CPU 
clock is halted pending an interrupt or wake-up to return it back to the Operating 
5 state. In the Standby state, PLL 108 is shut down, although Crystal 111 and 

oscillator 1 10 and RTC circuitry 112 remains active. The external address and data 
buses are also forced low in the Standby State to prevent any powered down 
peripherals from draining current. It should be noted that integrated circuit 100 
when first powered, or during a cold reset, is forced into the Standby state, which 

1 o can only be left by external wake-up prompt. 

In addition to state control circuitry 114, power management is also 
effectuated through power management control block 115. The state of various 
functional blocks of integrated circuit 1 00 in each state are tabulated in TABLE 1 2. 
Power management circuitry 115 forces integrated circuit 100 into the Standby 
1 5 mode when an active low power failure signal PWRFL is received from an external 

power supply unit 201 . If integrated circuit 100 is being driven by an external DC 
power source 202, the external power sense input signal EXTPWR is driven active 
low. If a battery 203 is being used, an active high on the B ATOK pin indicates that 
the main battery is OK. The falling edge of this signal generates an FIQ (fast 

2 0 interrupt request) while a low level signal on this pin in the Standby state inhibits 

system start up. The new battery sense signal BATCHG indicates that a new 
battery is required; an active low at this input occurs if the battery voltage falls 
below a "no battery" threshold. The battery powering integrated circuit 100 could 
be, for example, one or more standard AA batteries widely available to retail 
25 consumers. 

When unexpected events arise during execution of a program (i.e., an 
interrupt or memory faults) an exception is usually generated. When multiple 
exceptions occur, interrupt controller 116, operating on a fixed priority system, 
determines the order in which the exceptions are serviced. 

8 



BNSDOCIO. <WO 0157676A1_I_> 



PCT/USOl/02560 

WO 01/57676 

Integrated circuit 1 00 operates on two interrupt types, namely the interrupt 
request (IRQ) and the fast interrupt request (FIQ). HQs have a higher priority than 
IRQs. When two or more interrupts of the same type occur simultaneously, the 
contention is resolved in software. TABLES 2A-2C set out preferred interrupt 
5 allocation, whereinlNTMRl andlNTSRl are respectively the First Interrupt Mask 

RegisterandFirstmtermptSta^ 

the Second Interrupt Mask Register and Second Interrupt Status Register, and 
INTMR3 and INTSR3 are the Third Interrupt Mask Register and Third Interrupt 
Status Register. It should be noted that if two interrupts are received from within 
10 the same group (IRQ or FIQ), the order in which they are serviced is preferably 

resolved in software. 

Generally, interrupt controller 116 operates as follows. An external or 
internal interrupting device asserts the appropriate interrupt. If the appropriate bit 
is set in the corresponding Interrupt Mask Register, then either an FIQ or IRQ is 
15 asserted by Interrupt Controller 116. If the interrupts are enabled, processor 101 

jumps to the appropriate address. Interrupt dispatch software then reads the 
corresponding Interrupt status register to establish the source of the Interrupt and 
calls the appropriate Interrupt service routine software which then clears the 
Interrupt source through some action specific to the interrupting device. The 
2 o Interrupt service routine may then re-enable interrupts, and any other pending 

interrupts are similarly serviced. All other external Interrupt sources are held 
active until the corresponding service routine starts executing. 

TABLE 3 sets out the external interrupt latencies. In the Operating state, 
processor 101 checks for a low level on its FIQ and IRQ inputs after each 
2 5 instruction is executed. Hence, there is an Interrupt latency directly related to the 

amount of time it takes to complete the current instruction after an Interrupt 
condition is first detected. In the Standby state, the latency will depend on whether 
the system clock is shut down and if a control bit FASTWAKE in the system 
control registers is set. As indicated above, PLL 108 is always shut down in the 
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Standby state. If the FASTWAKE bit is cleared, the latency is between 0.125 
seconds to 0.25 seconds. If this bit is set however, then the latency will be between 
250 microseconds and 500 microseconds. If an external clock is used and disabled 
during Standby, the latency may also be between 0. 1 25 seconds and 0.25 seconds 
5 to allow for oscillator stabilization. If the external clock is not disabled, the latency 

can be reduced to a few microseconds. An Interrupt can also cause integrated 
circuit 100 to leave the Idle state. In this case the CPU clock must be restarted and 
additionally, interrupt servicing may be delayed for instruction execution as 
described above. 

10 In the illustrated embodiment, an on-chip boot ROM 1 1 7 is provided which 
maintains a set of instructions for initializing integrated circuit 100. On-chip boot 
ROM also configures UART1, discussed further below, to received 2048 bytes of 
serial data which are downloaded into on-chip SRAM 118. Once the data has 
been downloaded into SRAM 1 18, processor can continue executing instructions 

15 by jumping to the start of the SRAM. Advantageously, this configuration allows 

code to be downloaded to program a system flash memory during the manufacture 
of a device employing integrated circuit 100. It should be noted that the user may 
select between booting from on-chip ROM 117 or from an external memory 
connected to port CS[0]. Specifically, if the signal at pin MEDCHG is low, boot 

2 0 is from on-chip ROM 1 1 7 while a high signal applied to this pin requires that boot 

be performed from the external memory. It should also be noted that the effect of 
booting from the on-chip boot ROM is a reversal of the decoding of all chip select 
signals internally. This feature is illustrated in TABLE 5A with the normal, 
unreversed chip select decoding illustrated in TABLE 5B, Additionally, boot can 

25 be accomplished from external memory, with the width of the boot device having 

a selectable width in accordance with TABLE 4. 

The ARM720T processor has a 4 Gbyte address space. In the illustrated 
embodiment, integrated circuit 100 uses the lower 2 Gbytes of the address space 

10 
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for ROM/RAM/Flash and expansion space. Another 0.5 Gbyte is used for DRAM 
and the remaining 1.5 Gbytes, less 8K for internal registers, is unused. 

A memory and I/O expansion interface supports six separate linear memory 
or expansion segments to external expansion memory 204. Two additional 
5 segments are dedicated to the on-chip SRAM and ROM. Each segment is 256 

megabytes in size. Any of the six segments can be used to support a conventional 
SRAM interface. Moreover, each segment can be individually programed to be 8-, 
16- or 32-bit wide, to support page mode accesses, and to execute from one to eight 
wait states for nonsequential addresses, and zero to three for burst mode accesses. 
10 The zero wait state sequential feature allows integrated circuit 1 00 to interface with 

burst mode ROMs. It should be noted that the on-chip ROM space is fully decoded 
while the complete SRAM address space is folly decoded only up to the maximum 
size of the video frame buffer used to drive an external LCD (up to 128 kBytes). 
Twooffoeexpansionsegmentscanberesei^edforestablisWnganmterface 

15 with two PC Card cards 205 using the chip select signals NCS4 and NCS5. 

Interface with the external PC cards is preferably made through Cirrus Logic 
CL-PS6700 PC card slot drivers 206. The memory is segmented to allow different 
types accesses to take place (i.e., attribute, I/O, and common memory space). 

The EXPCLK port to expansion control block 1 19 outputs an expansion 

2 o clock which is equal to the CPU clock in the 13 MHz and 18 MHz modes, and has 

a rate of 36.864 MHz when integrated circuit 100 is operating in the 36, 49, or 70 
MHz modes. (EXPCLK port is used as the clock input in the 13 MHz mode 
discussed above). The EXPRDY pin (Expansion Port Ready) is driven low by the 
external expansion devices to extend the bus cycle and insert wait states. The chip 

2 5 select signals CS[0:3] are used for SRAM expansions while chip select signal 

CS[4:5] can be used for either memory expansion or PC card selection. The write 
strobe WRITE is low during reads from and high during writes to, the expansion 
devices. The wor(l/halfword bits (2) indicate to the external devices during writes 
from integrated circuit 1 00 whether the access size is in words, halfwords or bytes. 



11 
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DRAM controller 120 provides a programmable 16-bit or 32-bit wide 
interface to up to two banks 207 of DRAM, with each bank having a storage 
capacity of up 256 Mbytes. The DRAM banks can be any of a number of types of 
DRAMs available in the marketplace, including conventional DRAM, synchronous 
5 DRAM (SDRAM), extended data out DRAM (EDODRAM), fast page mode 

DRAM, and double data rate DRAM (DDRDRAM). Moreover, these DRAMs can 
be of the self-refresh type which are placed in a low power state when integrated 
circuit 100 enters the Standby state discussed above. To support two banks, two 
row address strobes RAS[0:1] can be generated along with four column address 

1 0 strobes CAS[0:3]. The output enable signal MOE is used for either the DRAM, 

ROM/SRAM/Flash or expansion output enable. The write enable signal NWE is 
used for the same set of external devices. Additionally, DRAM controller includes 
a programmable refresh counter, with the refresh period by controlled using the 
refresh period register (DRFPR). 

15 The preferred physical to DRAM addressing is shown in TABLE 6. 

TABLES 7 and 8 illustrate DRAM address mappings for 32- and 16-bit DRAM 
memory systems. The 32-bit is assumed to be based on two x 1 6 devices connected 
to each RAS line with 32-bit DRAM operations selected. The mapping is repeated 
for every 256 Mbytes in each bank. The placeholder "n" is these tables is equal to 

2 0 OxC + bank number. The 1 6/32-bit DRAM selection is programmed by setting a 

bit in system control register SYSCON2. 

Flash interface 121 allows integrated circuit 100 to interface with flash 
memory, using the chip select signal CS[0:1] described above. 

LCD controller 122 provides all the necessary control signals to allow 

25 integrated circuit 100 to interface directly to a single panel multiplexed LCD 

module 209. The total frame buffer size is programmable up to 128 KBytes, using 
both on and off chip memory. A system can be built using no external DRAM, 
with on-chip SRAM 1 1 8 used as the LCD video frame buffer, as described above. 
The screen is preferably mapped to the video frame buffer. 

12 
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LCD direct memory access (DMA) engine 123 is provided for fetching 
display data for LCD controller 122 from frame buffer memory. The pixel bit rate, 
hence the LCD refresh rate, can be programmed from 18.432 MHz to 576 kHz 
when operating in the 18.432-73.728 MHz modes, or 13 MHz to 203 kHz when 
operating from a 13 MHz clock. 

Integrated circuit 100 includes a pair of universal asynchronous 
receive-transmit (UART) interfaces 124 and 125. These asynchronous ports can 
be used, for example, to communicate with a pair of RS-232 transceivers 210. 
Each UART 124/125 can support data rates of up to 1 15.2 Kbits per second, when 
integrated circuit 100 is operating from clocks generated by PLL 108. When 
integrated circuit 100 is being driven from a 13 MHz external clock source, the 
UART bit rates that can be generated include 9.6 Kbps, 19.2 Kbps, 38 Kbps, 58 

Kbps and 115.2 Kbps. 

Both UARTs 124/125 include a 16-byte transmit FIFO driving a 
corresponding transmit (TX) pin and a 16-byte receive FIFO for receiving data 
from a dedicated receive (RX) pin. An RX interrupt is asserted when a given RX 
FIFO becomes one-half full or if that FIFO is non-empty for longer than three 
character length times with no more characters being received. A TX interrupt is 
asserted whenever the given TX FIFO buffer reaches one-half empty. 

In addition to RX and TX ports, UART 124 (UART1) can also receive the 
three modem control signals CTS, DSR, and DCD. An additional modem control 
RI input and output modem control signals RTS and DTR can be implemented 
using the GPIO ports 129 discussed further below. A Modem Status Interrupt for 
UART1 is generated if any of these modem control bits change. 

UART operation and line speeds are programmable through the UART bit 
rate and line control registers (UBLC1 and UBLC2). Additionally, four of the 
FIFOs can also be programmed to have a 1 -byte depth. Framming and parity error 
bits, which are detected at each byte is received, are also readable from 1 1 -bit wide 
registers. 
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Integrated circuit 1 00 also includes an IrDA (infrared data association) SIR 
protocol post processing stage 126 at the output of UART1 124. It includes a pin 
for driving an infrared light emitting diode (LED) and an input for connection to 
a photodiode (shown together as block 21 1 in FIGURE 2). SRI encoder 126 is 
5 switched into the TX and RX ports of UART 1 , such that these signals can drive the 

infrared interface directly. 

Integrated circuit 100 additionally includes an SPI/Microwire master mode 
128 Kbps ADC interface 127 and serial interface 128, which is shown in further 
detail in FIGURE 6. A preferred serial pin assignment for the Digital Audio Port 

10 is found in TABLE 10. SPI interface 1 (ADC interface) can be used to 

communicate with an external analog to digital converter 212 and digitizer 213. 
Serial interface block 128 includes a master slave mode SBI/Microwire (SSI2) 
interface 603, digital audio interface (DAI) 601, and codec interface 604, all of 
which "are multiplexed through Tnultipiexer~602~ onto a single~set of external 

1 5 interface pins. The selected interface drives the corresponding circuitry in block 

214 of FIGURE 2. Multiplexing is controlled by programming corresponding 
fields in the system control registers. A summary of the available serial interface 
options is provided in Table 11. 

ADC interface 127 is compatible in a default mode with SSI or Microwire 

2 0 compatible devices such as the MAXIM, MAX148/9 peripherals. ADC interface 

127 also can interface with devices such as Analog Devices AD7188/12 chips 
using NADCCS as a common RFS/TFS line. Exemplary timing diagram when 
integrated circuit 100 is driving a MAX 148/9 and a AD7811/2 are provided as 
FIGURES 7A and 7B respectively. An exemplary I 2 S interface is shown in 

25 FIGURE 8. 

The clock output frequencies for ADC interface 127 can also be set using 
the system control registers. In the 1 8.432-73.728 MHz operation mode, the ADC 
clock (ADCCLK) can be set to either 4, 16, 64, or 128 KHz. When integrated 
circuit 100 is operating in response to a 13.0 MHz externally generated clock, the 
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ADC clock can be set to 4.2, 16.9, 67.7, or 135.4 kHz. The sample clock 
SMPCLK always runs at twice the frequency of the shift clock (ADCCLK). The 
available ADC frequency options are set forth in TABLE 12. 

The ADC serial output ADCOUT is fed by either an 8-bit or 16-bit shift 
5 register in response to a bit set in the SYNCIO register. The ADC serial input 

channel ADCIN is captured by a 16-bit shift register. The ACD clock 
synchronization pulses are activated by a write to the output shift register. During 
transfers an SSIBUSY (synchronous serial interface) busy bit in the system status 
flag registers the set. When the transfer is complete and valid data is in the 16-bit 

1 o read shift register, an SSEOTI interrupt is asserted and the SSIBUSY bit is cleared. 

The sample clock SMPCLK is independently enabled. 

Digital Audio Interface 501 provides an interface to CD quality A/D and 
D/A converters, such as that shown in FIGURE 9. (DAIs are a subset of I 2 S). 
1 28-bit frames of 1 6-bit stereo digital audio, at the audio sampling frequency, with 
15 separate transmit and receive lines. It should be noted that each frame only 

contains 16 bits of right channel and 16 bits of left channel audio data. The 
remaining bits are set to zero. 

FIGURE 10 is an exemplary timing diagram illustrating the operation of 

2 0 DAI 601 . The left-right clock (LRCK) provides the frame synchronization signal. 

The serial clock (SCLK) is the bit transfer clock and preferably has a rate fixed at 
128 times the audio sample frequency. The SDOUT (SDATAO) and SDIN 
(SDATAI) are respectively used for sending playback data an external D/A 
converter and for receiving record data from an external A/D converter. Timing 
2 5 between integrated circuit 1 00, an external D/A converter and/or an external A/D 

converter is based of the oversampled clock MCLK. Preferably, the MCLK has a 
rate fixed at 256 times the sampling frequency. 

Asynchronous serial interface 2 (SSI2) 503 is an SPI/microwire interface 
that can operate in a full master-slave mode. FIGURE 10 illustrates a pair of 
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integrated circuit 1 00 devices configured to operate in a Master-Slave fashion. The 
preferred sustained data rate is 85.3 Kbps, which ensures a sufficiently long period 
between interrupts. An interrupt is generated when the receive FIFO is half-full 
and the transmit FIFO is half-empty. In the slave mode, the serial clock (SSICLK) 
5 and the serial receive port (SSIRXDA), the received synch control pin (SSIRXFR) 

and the transmit synchronization pin (SSITXFR) are inputs and the transmit pin 
SSITXDA is an output. In the master mode, pins SSICLK, SSITXDA, SSITXFR 
and SSIRXFR are outputs and pin SSIRXDA is an input. Mode selection is 
through the programming of bits in the system control registers. 

1 o Asymmetric (unbalanced) and continuous traffic are both supported through 

the use of the separate transmit and frame synch control lines SSITXFR and 
SSIRXFR. In this configuration, the receiving node receives a byte of data on the 
eight clocks following the assertion of the received frame synch control signal and 
the sending node transmits a byte on the eight clocks following the assertion of the 
15 independent transmit frame synch control pulse. Exemplary timing diagrams 

illustrating the operation of these two interfaces are provided in FIGURES 7 A and 
7B for reference. 

Codec Interface 604 supports a direct connection to a telephony codec. 
Along with clock and control signal generation, codec interface 604 also performs 

2 0 parallel to serial and serial to parallel conversions. The interface is full duplex and 

employs corresponding transmit and receive FIFO operating at 64 Kbs. When 
enabled, the codec interrupt CSINT is generated every 8 bytes transferred (i.e., 
FIFO half full/empty) or, in other words, every 1 msec with a latency of 1 msec. 
This timing is illustrated in FIGURE 8, where CDENRX and CDENTX are 
25 respectively the receive and transmit control bits in system control register 

SYSCON1. 

DAI 60 1 supports an I 2 S interface, such as interface 900 shown in FIGURE 
9. In this case, both an external ADC 901 and an external DAC 902. A clock 
source 903 provides the time base. An exemplary timing diagram is provided in 
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FIGURE 10. In FIGURES 9 and 10, the MCLK is the oversampled clock which 
is typically fixed at 256 times the audio sampling frequency. The SCLK is the bit 
clock which is typically fixed at 128 times the audio sampling frequency. The 
LCLK is the frame sync signal and is typically fixed at the audio sampling 
5 frequency. SDOUT is the audio data output sending playback digital audio to DAC 

902. SDIN receives record data from ADC 901 . 

SSI1 interface 603 supports master-slave operation as shown in FIGURE 
11. This interface provides a means for effectuating full duplex serial transfers 
between two nodes. Data are transferred in bytes in response to a clock and a 
l o frame synchronization signal. 

Integrated circuit 100 is also provided with a set of general purpose input 
output (GPIO) ports 129. In the illustrated embodiment, there are three 8-bit ports 
and one 3-bit port. The GPIO ports can be used for such purposes as establishing 
an interface with a keyboard driver 215. 

Pulsed with modulator (PWM) circuitry 130 includes two outputs for 
driving DC to DC 216 converters operating in conjunction with external power 
supply unit (PSU) subsystem 201 . The external input pins normally connected to 
the output from comparators monitoring the external DC to DC converter output 
are used to enable these clocks. When integrated circuit 100 is operating from 
internal PLL 108, the PWM clocks have a frequency of 96 kHz. The duty cycle 
ratio for these signals can be programmed from 1 and 16 to 15 and 16. The sense 
of the PWM drive signal active cycle can be set high or low by latching the state 
of the drive signal during power on reset (i.e., a pull up on the drive signal will 
results in an active low drive output, and vice versa). As a result, either positive 
or negative voltages can be generated by the external DC to DC converters. These 
outputs can similarly be disabled by clearing bits in a control register. 

Communication between the blocks of integrated circuit 100 is established 
through an Advanced Peripheral Bus 132 and an Advanced Peripheral Bus Bridge 
131. Internal data bus 106 is 32-bits wide and can be connected to the external 
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devices through multiplexing circuitry 132. Internal address bus 107 is 28-bits 
wide and can communicate with external devices through multiplexing circuitry 
133. ICE-JTAG circuitry 134, which is IEEE 1149.1 compliant, is included for 
boundary scanning during test and development. Additionally, the Embedded ICE 
5 supports the debugging of the ARM processor core. 

In the preferred embodiment, the internal registers of integrated circuit 1 00 
are in the little endian configuration. However, integrated circuit 100 can 
advantageously interface with a big endian external memory system. Specifically, 
the big end bit and the CPU 101 register sets determines whether words in the 

10 external memory are being stored in a big endian or little endian format. 

Specifically, memory is viewed as a linear collection of bytes numbered upwards 
from zero. Bytes 0-3 hold the first stored word, bytes 4-7 the second stored word, 
and so on. In the little endian scheme, the lowest number byte in a word is 
considered to be the least significant byte of the word and the highest number byte 

15 is the most significant word. Thus, byte zero in a little endian system are 

connected to data lines 7-0. In the big endian scheme, the most significant byte of 
a word is stored at the lowest numbered byte, and the least significant byte is stored 
at the highest number byte. Therefore, byte zero in a big endian system is 
connected to data lines 31-24. In the illustrated embodiment, only the load and 

2 0 store instructions are effected by endian-ness. TABLES 13 and 14 illustrate the 

operation of integrated circuit 1 00 for both reads (TABLE 1 3) and writes (TABLE 
14). It should be noted that the column address strobe lines NCAS[3:0] to the 
DRAM banks are always connected to the same byte lane irrespective of the 
endianness. For example, column address strobe line NCAS[0] will be associated 

2 5 with data line D[7:0] and NCAS[3] associated with data lines D[31:24]. As a 

result, in a little endian system, line NCAS[0] will be asserted for a read/write to 
DRAM byte 0 and in a big endian system, line NCAS[3] will be asserted to access 
DRAM byte 0. 
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Integrated circuit 100 includes a set of programmable fuses which allow 
each chip to be assigned one or more unique ID numbers and passwords. The 
programmable fuses and related registers are disposed within Security Registers 
and Hardware block 133 operating off APB 132 (FIGURE 1). With specific 
5 regards to the embodiment ofFIGURE ID, the boot ROM itself will reside on the 

ARM local bus 107 and the access checking will be split and have logic on both 
the ARM local bus and within the ARM local-global AHB wrapper. 

In the preferred embodiment, the are 256 programmable fuses, including 
sets of public and private fuses. The addresses and values of the private fuses are 
10 hidden such that only private firmware corresponding to those fuses is allowed 

accesses. In a non-private environment, these addresses and values return allzeros. 
The public fuses are described in TABLE 15 and the private fuses in TABLE 16 

Integrated circuit 100 also includes embedded hardware within block 133 
15 tocheckmemsedhammingcodewiththehammingcodethatmatchestheselected 

ID. When the validation address is read, the ID value is matched with it hamming 
value and verified. The resulting 5 bit code provides debug information (all fuses 
blown or all fuses not blown) if the hamming codes do not match. Table 17 gives 
thedecodingforthe validation readbits. This advantageously allows detects errors 

20 which have occurred when the fuses were blown, while preserving the 
inaccessibility of the fuse values and addresses. 

Table 18 provides the addresses that return the validation codes for the 

public ID-CHK pairs. 

Table 19 provides the addresses that return the validation codes for the 
2 5 private ID-CHK pairs. These addresses are only accessible by the firmware when 
integrated circuit 100 is operating in a private mode and will read 0's otherwise. 

In order to allow the hamming code generator to be adequately tested there 
are two test registers that can be selected as the ID-CHK pair and validated. Their 
definitions and locations are provided in Table 20. 
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Figure IB is a high level functional block diagram of a second system on 
a chip 140 suitable for practicing the principles of the present invention. This 
embodiment employs an ARM920T processor 141 having both instruction and 
data caches, as well as an MMU. System 141 does not include general purpose 
5 SRAM in contrast to integrated circuit 100. 

FIGURE 13 is a more detailed functional block diagram of processor 141, 
in particular for those embodiments based on an ARM 920T core. In this 
embodiment, the available cache comprises both an instruction cache 1301 and a 
data cache 1302. Similarly, separate instruction and data MMUs 1303 and 1304 
0 are used. The instruction modified virtual address (1MVA), instruction physical 

address (EPA) and instruction data (ID) buses are each 32 bits wide. Similarly, the 
data modified virtual address (DVMA), data physical address (DPA) and data data 
(DD) buses are 32 bits wide. Physical addresses and data are exchanged to AHB 
bus 142 through AMBA bus interface 1305. A write buffer 1306 allows for the 
5 parallel exchange of data through interface 1 305 during processor core operations. 

Data from data cache 1302 can be output through write-back physical address 
(PTAG) RAM 1307. 

Integral to the processor core is a coprocessor which includes a register for 
translating virtual addresses issued by the CPU into the modified instruction and 
: 0 data virtual addresses (MV A) transmitted on IMVA and DMV A shown in FIGURE 

IB. Specifically, for addresses to the memory region from 0 to 32 MBytes, the 
virtual address VA is modified by a 7-bit process identifier as VMA = VA + 
(ProcID x 32 MBytes), where the process identifier ProcID is either a read or write 
process identifier. 

. 5 System 141 is based on an internal AHB (Advanced Microcontroller Bus 

Architecture High-Speed Bus) 142, as well as an internal an APB (Advanced 
Peripheral Bus) 143. An AHB/APB bridge 144 interfaces AHB 142andAPB 143. 
A second bridge 145 interfaces processor 141 with AHB 142. 
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Among the devices operating off AHB 142 are Graphics Engine 146 and 
Raster Engine 147. Generally, Graphics Engine off-loads such functions as block 
transfers and line draws from processor 141 to improve system graphics 
performance. Preferably, Graphics Engine 146 uses a standard Device Independent 
Bitmap (DIP) format for supporting Windows CE. Raster Engine 1 47 is provided 
to raster data from an external display buffer, through synchronous DRAM 
interface 148, to drive an external LCD, CRT or TV display unit. 

Additional on-chip interfaces to internal AHB include an interface 149 for 
coupling system 141 to an external system bus, a PCMIA for interfacing with an 
external PC card, and Test Interface Controller (TIC) interface 151 for testing such 
on-chip circuit blocks as the DMA controller and the raster system. Memory 
interface 152 provides for the exchange of control signals and data with external 
SRAM, Flash or ROM in a manner similar to that discussed above. Finally, boot 
of the system, which will be discussed further below, is effectuated, at least in part, 
using the Boot ROM 153. In this example, boot ROM 153 is operating off AHB 
142, although it could run off any one of a number of global and local buses in 

alternate embodiments. 

System 140 includes an 8-channel DMA engine 154, which prioritizes and 
services request by on-chip resources, such as the U ARTs, for accesses to external 
memory. The Joint Test Action Group (JTAG) port 155 supports debugging of the 
on-chip processor and related circuitry Additionally, aUniversal Serial Bus (USB) 
controller 156 and Ethernet port 157 operate directly from the AHB. 

A number of peripheral devices are provided on-chip and operate off of 
APB 143. Among other things, system 140 includes three UARTs 158, 159 and 
160. Additionally, apair of SP1 interfaces 161 and 162 and an AC97 interface 163 
are included in the illustrated embodiment. A real time clock (RTC) 1 65, general 
timer set 166 and watchdog timer 167 are also provided in this embodiment. An 
additional memory interface, EEPROM interface 1 68, also couples to the APB. 
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Manual input of data can be made through an external key matrix coupled 
to Key Matrix Interface 169, or a Touchscreen interfacing with Touchscreen ADC 
171 and Touchscreen Interface 170. LED outputs 172 are also included in the 
system 140 user interface. 

Similar to integrated circuit 100, system 140 includes a set of general 
purpose input/output (GPIO) ports 173, an interrupt controller 174, and on-chip 
PLLs 1 75 driving system control circuitry 1 76. Control circuitry includes memory 
remap and system pause control circuitry 177. Flash VPP Control block 178 
generates the voltages required for writing and erasing external flash. 

FIGURE 1 C is a high level functional block diagram of another exemplary 
system-on-a-chip 180 to which the principles of the present invention can be 
suitably applied. In this instance, the CPU core 181, which could be for example 
a ARM7TDMI controller, does not utilize an MMU or on-chip cache. CPU 181 
operates in conjunction with the AHB bus 142 via a local AHB bus and 
Local/Main AHB interface 182. CPU 181 is supported by memory 183, security 
gates 184 and security/reset circuitry 185. Security will be discussed in further 
detail below. 

In this embodiment, system 180 additionally includes a digital signal 
processor (DSP) 186 supported by global memory 189, data memory 190 and 
program memory 191. Interprocessor communications registers 192, I 2 S Audio 
Input/Output Port 193, PWM circuitry 194 that is capable of driving external 
speakers at CD quality levels without using analog DAC support circuits, and DSP 
Timer/STC 195 communicate with DSP 1 86 via DSP Peripheral Bus 196. These 
devices also operate off the APB. Among the peripheral devices also operating 
from the APB include a USB Slave Port 197, SPI for Serial Media Input 198 and 
I 2 S Host Port 199. 

The Motion Picture Expert Group (MPEG) audio compression standard 
defines the syntax for a coded stream of digitized audio data, along with a process 
for decoding that stream. In the audio arena, three layers, Layers I - HI 
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respectively, are defined. For purposes of the present discussion, Layer m, which 
provides the highest quality audio reproduction, will be considered. 

The encoding process begins with the sampling of one or more audio 
channelsatagivensamplingrate,whichmaybe32,44.1 or 48 kHz. Theresulting 
digitized stream is passed through a polyphase filter bank which divides the 
receivedtime-domainstreaminto32frequencysubbands. Typically the filter bank 
operates of 64 input samples at a time with 50% overlap such that 32 output 
frequency-domain samples are produced for 32 input time-domain samples. 

A psychoacoustic model is used to remove those parts of the audio signal 
which cannot be heard by the human ear due to auditory masking. Auditory 
masking is the characteristic of the human auditory system wherein a strong audio 
signal renders a temporally or spatially close weaker audio signal imperceptible. 
Moreover, the ability of the human ear to distinguish sounds is frequency 
dependent. Within certain critical bands, the ear does not precisely delineate 
between various in-band audio components. The processing subbands, which 
approximate these critical hearing bands, are quantized as a function of the 
audibility of the quantization noise within that subband. 

The psychoacoustic model engine, operating in parallel with the polyphase 
filter, determines the available noise masking for a given frequency component and 
a given loudness. From this information, the data stream output from the 
polyphase filter are quantized and coded. In Layer m, each of the 32 subbands 
output from the polyphase filter are passed through a window which parses the 
stream into long blocks of 18 samples or short blocks of 6 samples, with 50% 
overlap such that the window lengths are respectively 36 and 12 samples wide. 
Long blocks are used to achieve better frequency resolution for the relatively 
constant components of the audio signals while short blocks are used for improved 
frequency resolution of transients. The blocks for each subband are then processed 
with a Modified Discrete Cosine Transform (MDCT). The subbands are further 
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divided in frequency to improve spectral resolution such that some of the aliasing 
caused by the polyphase filter can be canceled. 

In MPEGx, Layer m, the quantization is non-uniform to make the signal 
to noise ration over the range of quantization values more consistent. Additionally, 
5 Layer EI utilizes scale factor bands of approximating critical band widths and cover 

several MDCT coefficients. The scale factors are used during noise allocation to 
vary the frequency-dependent masking threshold, and essentially set the gain for 
each subband. Moreover, Huffman encoding is performed on the quantized MDCT 
coefficients for improving data compression. Finally, a "bit reservoir" is employed, 

10 to which bits can be donated when less than the average number of bits are required 

to code a frame and from which bits can be borrowed when more than the average 
number of bits are required to code a frame. 

Frames are formed from a header, a CRC value, side information and main 
— dataralthough-therelativeposition-of these components of the frame are not 

15 necessarily always in the same sequence, or even adjacent in the stream. The 

header includes a set of frame sync bits, MPEG version and layer identifiers, a 
CRC protection bit, a bitrate index indicating the bitrate at which the frame was 
created, and a sampling rate frequency index indicating the frequency at which 
audio data was sampled, and along with additional information about the 

20 transported data. 

An MPEG-1, Layer HI bitstream can then be decoded generally as follows. 
Data is input to the decoder in a predetermined number of frames per second. The 
frame sync bits in the header portion of each frame is detected. Next, the scale 
factors are extracted and decoded. This is followed by decoding of the Huffman 

25 encoded main data representing the frequency energies. The scale factors are 

applied and the data requantized. At this point, if stereo data is being processed, 
the stereo channels are recovered and aliasing reduction performed. An inverse 
MDCT operation is performed to followed by an overlapping inverse Discrete 
Cosine Transform (DCT) to return the data to the time-domain. A low pass filter 
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is applied to recover the PCM samples, each of which is essentially a weighted 
average of the adjacent 512 time-domain samples. 

When integrated circuit 1 00 is configured as an MPEGx, Layer m decoder, 
a stereo DAC, such as a Chrus Logic CS43Lxx Stereo Audio DAC, is coupled to 
Digital Audio Port 128 for driving a set of headphones. An analog to digital 
converter, such as a Cirrus Logic CS53L32 Audio A/D Converter may also be 
coupled to this port for the input of data from a microphone. This embodiment of 
FIGURE ID includes an on-chip PWM circuit that can drive headphones at CD 
quality levels without an external stereo audio DAC. 

It is often necessary to prevent tampering, copying or logic analyzer 
examination of the software and firmware bundled with an electronic product. As 
a result, some level of security must be provided, for example through the use of 
encrypted passwords, which allow the manufacturer authorized end users access 
to the system memory assets for purposes of downloading, debugging, and 
upgrading the software or firmware, but denies that same level of access to 
unauthorized end users. In the context of a digital audio player, this will allow 
online music distributors the confidence to allow end users who have paid the 
royalty and received the requisite passwords to download songs, with the 
knowledge that unauthorized downloads will at least be deterred to some degree. 

Generally, there are several criteria a security scheme must meet. First, the 
system must not allow unauthorized access as a result of power-on reset. Second, 
secure information, such as encrypted passwords, security code, and the 
information concerning locations in memory where the secure information resides, 
must not be readily accessible outside the system. Notwithstanding, this secure 
information must be checked during production test procedures to guarantee 
acceptable end user system quality with regards to normal manufacturing defects. 
Finally, if security measures are not provided or not invoked, normal operation of 
the system should proceed in the expected fashion. 



25 



BNSDOCID: <WO. 0157676A1 _l_> 



WO 01/57676 



PCT/US01/02560 



Advantageously, the principles of the present invention provide security 
techniques which allow integrated circuit 100 to meet each of these criteria. In 
accordance with one such technique, the capability of processor 101, in response 
to either certain default conditions or the dynamic assertion of certain instructions, 
5 to reverse the Chip Select signal decoding discussed above is employed. By 

reversing the chip select decoding on power-on reset, the security code can be run 
from a normally unaccessible memory space. Moreover, this feature of processor 
101 can only be invoked in a specific period of time when processor 101 is not 
executing instructions, which further complicates any attempted security breach. 

10 FIGURE 12 is a flow chart illustrating a preferred procedure 1200 for 

booting integrated circuit 100 in accordance with the inventive concepts. It will be 
assumed that processor 101 is an ARM720T or ARM920T processor, and signal 
names will be in reference to the signals and/or instructions thereof. The procedure 
begins with the power on reset of integrated circuit 100 by the assertion of the 

15 power-on reset (NPOR) signal, at Step 1201. Circuits within the system 

immediately disable all hardware and debug features and hides all security 
elements (e.g. firmware, registers, passwords) from external probing (Step 1202). 
This step insures that the system is secure, at least up until Step 1203, where a 
check is made to determine if security firmware routines are in place and enabled. 

20 In the preferred embodiment, this is accomplished by reading the programmable 

fuse registers. 

For purposes of discussion, the case where security is not provided or is 
disabled will be considered first. At Step 1204, a determination is made as to 
whether boot is to continue from an internal ROM or if an external memory will 
25 be used. For the ARM processor embodiments, the NMEDCHG bit is used to 

select between internal and external boot memory options. If at Step 1204, the 
signal at pin NMEDCHG is clear (i.e. in an active low state), then boot of 
integrated circuit 100 will be from internal ROM. In this case, the address 
mapping to internal boot ROM is reversed by default at Step 1 205. After reversal 
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of the address mapping, execution is from current boot ROM location 0 (Step 
1207). In this illustrated embodiment, the power-reset signal NPOR must be 
asserted to return the address mapping to its normal state. 

Alternatively, if the NMEDCHG bit is set (i.e. in an active high state), then 
5 boot will be from external memory (ROM/EPROM/Flash). In this case, the chip 

select mapping is set as shown in TABLE 5 A with the external Chip Select 0 being 
selected as the boot memory. 

Next, consider the case where a read of the programmable fuse registers 
indicates that a security routine is in place and enabled. Boot branches at Step 
10 1208 to the execution of the security procedure. 

Integrated circuit 1 00 can be configured to respond to different sets of boot 
and/or security code. This advantageously allows integrated circuit 100 to operate 
using the boot/security firmware from multiple vendors, even though the secure 
information of each vendor may only be accessible by that vendor's own 
1 5 boot/security procedures. Initially, the boot memory is programmed with multiple 

boot code sets or options. This can done using the internal boot ROM or one or 
more chips of external memory (ROM/RAM/Flash). With multiple boot options, 
the end user will be able to select between security firmware available from 
different vendors. 

20 Consequently, at Step 1209 a first one of the booting options in boot 

memory is identified and at Step 12 10 aliased to the reset vector, typically location 
0x00 for the first option. All necessary security elements (registers, firmware, I/O 
devices) required for the given implementation are enabled by the current boot 
option while all other security options (implementations) are kept hidden (Step 

25 1211). The selected boot code is then run by the processor at Step 1212toattempt 

to initialize for the selected security firmware/software. 

If at Step 1213 the proper security firmware/software is found in memory 
as called by the boot code, then integrated circuit 100 completes boot and runs in 
the selected secured environment at Step 1214 under supervising control of the 



27 



BNSDOCID: <WO 01 57676A1_1_> 



WO 01/57676 



PCT/USO 1/02560 



security firmware/software. On the other hand, if the required security 
firmware/software is not found, another boot option must be tried. 

If the last security option has not been reached at Step 1215, then the next 
security option in boot code is selected (Step 1 2 1 6). An instruction is issued which 

5 dynamically forces the processor to the new reset vector. In this instance, the reset 

vector jumps to point to the second security option in boot code. At Step 1218, the 
processing returns to Step 1211 and the boot process is attempted again. It should 
be noted that in the illustrated embodiment, the instruction pipeline has three 
stages. Consequently, the instruction resetting the program counter to 0 has already 

0 been loaded from internal boot ROM before execution of the instructions that 

changes the chip selects. The MOV pc, #0 instruction causes the processor 
pipeline to be flushed thereby allowing several cycles to occur before the change 
of chip selection must occur. During this process, no other accesses are allowed 
to those memory resources whose chip select signals will change during the 

5 execution of the remap command. 

This process repeats itself until either a security option is found which 
causes integrated circuit 100 to enter secure operation at Step 1214 or the last 
security option is reached at Step 1215. In the illustrated embodiment, the last or 
default option returns the boot procedure at Step 1219 back to a normal (unsecure) 

0 boot. Here, all the debug features are enabled and the security features are hidden 

at Step 1220. At Step 1221, a default boot ROM is selected and at Step 1222 the 
processor is dynamically forced to the reset vector. Notwithstanding, in alternate 
embodiments, default security code may be provided in order that integrated circuit 
1 00 can still run in a secure environment even though all of the primary options are 

5 unavailable. 

In embodiments of integrated circuit 100 based upon an ARM 920T, 
instructions and data can be locked into the corresponding instruction and data 
caches, such that they are not chosen as victims for replacement by the replacement 
algorithm on a cache miss. Locked in data/instructions guarantee a cache hit with 
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the corresponding information being fetched directly from cache and the favorable 
cache access latency. Moreover, the locked encached information is unaccessible 
outside of integrated circuit 1 00, except through the JTAG port or other test-debug 
modes that allow visibility to the cache or TLB memories. The JTAG port, used 
5 primarily during product development and testing, can be disabled integrated 

circuit 100 leaves the manufacturing floor. 

Before locking cache entries, the corresponding descriptors (physical 
addresses and permissions) must be locked in the associated Translation Look 
aside Buffers (TLBs) to obtain predictable performance results. In addition to 

1 o cache, many devices, such as the ARM 920T used in the present examples, include 

both data and instruction translation look-aside buffers (TLBs). For a given 
instruction or field of data, the CPU generates a virtual address. A modified virtual 
address is then presented to the corresponding TLB and a comparison is performed 
between fields of the modified virtual address and the comparison (tag) registers 
15 in the TLB. If a match occurs, and the access is allowable, (as determined by the 

permission fields in the TLB entry), the physical address bits returned from the 
corresponding TLB entry is used, along with index bits from the modified virtual 
address, to generate a physical address, accessing cache or external memory, as 
required. If a miss occurs, the process discuss below is invoked to translate the 

2 0 virtual address into a physical address in hardware. 

When cache lines are locked, the corresponding entries in the data and 
instructions TLBs should also be locked and exempted from replacement during 
TLB update. In the case of the ARM 920T processor, TLB entries are locked by 
writing identifiers for the specific entries in the data and instruction TLBs being 
25 locked into TLB Lock Down field of the System Control Processor register C15. 

TLB Lockdown procedure 1300 of FIGURE 13 is one method of locking 
entries in either an instruction or data TLB. At Step 1301, a page table is setup 
including physical address bits and permissions corresponding to the protected data 
or instructions. At least some of the entries in the target TLB are then flushed or 
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cleaned to insure that the code to be locked is not already in the TLB registers (Step 
1302). 

In embodiments employing the ARM 920T processor, both the data and 
instruction TLBs are organized in a single segment of 64 lines. A replacement 
5 (victim) counter points to the entry being replaced. Therefore, at Step 1303, the 

replacement counter is updated to point to the next entry to which locked 
information is to be written. In the preferred embodiment, the process begins at 
entry 0. 

For the instruction TLB a Prefetch instruction is used to generate a 
10 modified virtual address force a TLB miss (Step 1304). In the case of the data 

TLB, a Load instruction can be used to force the miss. After the miss has occurred, 
a page table walk must be performed to generate the descriptor (e.g. physical 
address and permissions) to be loaded into the TLB (Step 1305). At Step 1306, the 
descriptor generated from the page table walk, using physical address bits from the 
15 accessed page table entry and index bits from the modified virtual address, is 

loaded into the given TLB at the entry pointed-to by the current replacement 
counter contents. 

In the ARM 920T embodiments, the loaded TLB entry is locked at Step 

1 307 by setting a bit in a corresponding TLB Lockdown register. If the last entry 
2 0 has been reached at Step 1308, the procedure ends, otherwise, at Step 1309, the 

procedure loops back to Step 1303 and the replacement counter updated in 

preparation to load the next entry. 

Once the TLB entries are locked, the corresponding data or code can be 

locked in cache. For purposes of discussion, consider the case where instructions 
2 5 are locked into the ARM 920T instruction cache. The case of the data cache is 

similar. It should also be noted, that the inventive concepts are not limited to 

systems employing ARM processors, and can be applied to any system or device 

including Lockable instruction and/or data cache. 
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FIGURE 1 4 illustrates a cache lockdown procedure 1 400 for locking secure 
code into cache. As will be discussed further, in order to perform the locking 
operation, a cache miss must be forced in the illustrated embodiment. A preferred - 
method of forcing a cache miss is discussed later in conjunction with FIGURE 15. 
5 At Step 1401 an actual or emulated page table is set up with the physical 

addresses to the locations in memory where the data or instructions to be locked in 
cache reside. An emulated and synthesized page tables embodying the inventive 
concepts are also discussed further below. This table is used to update the 
corresponding TLB, preferably using procedure 1300. 
1 Q At Step 1 402, the given cache is flushed or cleaned of at least some cache 

lines to insure that the code to be locked-in is not already encached. The 
replacement (victim) counter associated with the cache is forced to point to the first 
cache line (cache line 0) at Step 1403. In the preferred embodiment, each of the 
data and instruction caches is partitioned into 8 64-line segments each indexed by 
1 5 index fields in the modified virtual address. In procedure 1400, cache lines will be 

filled sequentially, with for example, all cache lines 0 of all segments filled in 
sequence first, followed by the sequential fill of all cache lines 1, and so on. 

The data or instructions to be encached are generated, possibly requiring a 
decryption process (de-encrypted), and stored at corresponding locations in an 
2 0 alternative memory, such as internal SRAM or external SRAM/DRAM/Flash at 

Step 1404. Then, at Step 1305, a Prefetch Cache Line operation is performed for 
an instruction encache to invoke a look-up at the pointed-to cache entry. (For the 
data cache, the LOAD instruction can be used). This causes a cache miss thereby 
requiring the processor to access the alternate memory containing the necessary 
2 5 data or instructions. It can do this by referring to the TLB for the necessary bits for 

the physical address, if the TLB is current and accurate, or by walking through the 
page tables set up at Step 1401 directly. The physical address itself is generated 
from base address bits in the entry accessed in the TLB and index bits from the 
virtual address. 
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At Step 1405, the generated code or data is placed where the cache miss is 
to be processed and a line fill is performed at Step 1406 to the cache line at the 
current replacement pointer entry. Again, the cache segment is indexed by cache 
segment index bits from the virtual address causing the cache miss. 

If, at Step 1407, the last segment in the given cache not been reached and 
further cache operations are required, the processor increments the cache segment 
index bits at Step 1408 to force the next cache access to the next cache segment 
at the current replacement counter value. The procedure returns to Step 1404 and 
continues from there. 

However, if the just completed operation was to the last cache segment, and more 
cache operations are to follow (i.e. the last cache line to be filled has not been 
reached at Step 1409), then at Step 1410, the procedure jumps back to Step 1403, 
the replacement counter value is updated, and the procedure continues from that 
point. 

When all the code to be locked-in has been loaded, the Replacement 
Counter base is set to a value one higher that the base to the locked cache lines 
(Step 1411). This insures that the private data (now decrypted) will not be 
overwritten on a cache miss or become accessible by an unauthorized party. The 
code can then be executed from cache at Step 1412. 

One means of creating locked, encached data without using memory 
locations for the entire region to be locked is to use a cache line's length of registers 
to emulate the region. In addition, cache miss emulation can also be used to 
remedy hardware limitations on the cache locking granularity. For example, in the 
ARM 920T embodiment, cache can be locked in 64 word blocks (256 bytes). 
Each cache line however is only 8 words (32 bytes) long, and therefore can be 
mapped a different locations within the 64 word block, depending on the address 
bits. 

According to the inventive concepts, for each lockable location, eight (8) 
programmable 32-bit emulated cache line (ECLINE) registers are set up in an 
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alternate location in memory as eight contiguous 32-bit locations. Additionally, 
a comparison (offset) register (ECOFFSET) is setup which is programmed with 
physical addresses to identify where in the cache memory space the ECLINE 
register contents will reside after the emulated cache miss. Consequently, the 
location a single cache line in size can be used to represent an entire 64 word 

Lockable location. 

An emulated cache miss procedure 1500 is then set forth in the flow chart 
ofFIGURE 15. At Step 1501, the contents to be encached (in either the instruction 
or data cache) are written into the ECLINE registers. An offset to the Lockable 
cache space to which the data are to be written is then programmed into the 
ECOFFSET comparison register (Step 1502). 

At Step 1503, an operation is performed to cause a cache miss For the 
instruction cache, this can be done through a Prefetch Instruction for the instruction 
cache, and for the data cache, through a Load. The virtual address generated to this 
location causes a miss to the given cache, and the corresponding physical address 
is then generated using index bits from the virtual address and base bits retrieved 
from the appropriate TLB or alternatively, through a page table walk. At Step 1 505, 
the information in the corresponding ECLINE registers is retrieved and at Step 
1 506, loaded into the cache at the addressed entry. This entry is now prepared to 
be locked using procedure 1400. Advantageously, the procedure has allowed the 
locked portion of cache to be loaded without resort to either internal or external 
SRAM. 

As already indicated, page table walks are required during cache and TLB 
locking operations in order to generate addresses to physical memory from where 
the data or instructions are to be retrieved. The present inventive concepts allow 
for the creation of streamlined page tables which save on the amount of memory 
which must be dedicated to page table support. Moreover, even in view of a TLB 
miss, the inventive concepts also protect data and instruction code against 
tampering, copying or electronic analysis through secure operation of MMU 104 
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during address translation by section/page table walks. Again, an ARM 920T 
processor core will be considered for illustrative purposes, although the inventive 
principles can be applied to the memory management schemes of other processors 
and memory management units. 

A conventional page table walk for this embodiment generally proceeds as 
follows. During the Level 1 fetch, a section descriptor (Level 1), course page table 
or fine page table base address is retrieved from a 4096-entry Translation Base 
Table (TBT). The TBT is accessed using a TBT base address from the Translation 
Base Register and a Table Index field from the modified virtual address. 

If the output from the TBT is a section descriptor, that descriptor includes 
a Section Base Address and access permissions. A physical address to a 1MByte 
section of memory is then generated using the Section Base Address bits from the 
Level 1 descriptor and Section Index bits from the modified virtual address. 
(Assuming that the permissions contained-in-the Level 1 Section Descriptor are 
favorable). 

A Course Page Table Base address from the TBT, along with Level 2 
Table Index bits from the modified virtual address, accesses one of 256 entries in 
the Course Page Table thereby dividing the 1 Mbyte blocks into 4 Kbyte blocks. 
The Course Page Table returns either a Large or Small Base Address along with 
access permissions. Depending on the state of the permissions, the Large or Small 
Page Base Address bits are combined with Page Index bits from the modified 
virtual address to produce a physical address either a 64 Kbyte large page or 4 
Kbyte small page from memory. 

A fine page base address from the TBT, along with Level 2 Table Index bits 
from the modified virtual memory address, points to a 1 024 entry Fine Page Table. 
The output from this table is a Level 2 Descriptor which includes either a large, 
small or tiny base address along with access permissions. Large pages are 64 
kBytes, small pages 4 KBytes and tiny pages 1 Kbyte. Assuming the permissions 
indicate that the access is allowed, the page base address is concatenated with Page 
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Index bits from the modified virtual address to generate a physical address to either 
the large or small pages in memory already discussed, or lkbyte tiny pages in 
memory. 

The memory accessed as a result of the page table walk can be either cache, 
internal memory or external memory. The physical addresses and permissions are 
used to update the TLB. Any secure information is then locked, as described 

above, into the TLB. 

The disadvantage of this two-level table walk procedure comes from the 
fact that the various tables require substantial amounts of on-chip memory. As 
previously discussed, secure information must reside inside the system in areas of 
the memory not accessible by an unauthorized user. Therefore, some provision 
must be made for efficiently storing sensitive information, such as the physical 
address translation scheme, within the available internal memory. 

In the preferred embodiment of integrated circuit 100, the table walking 
process can be significantly simplified and the amount of memory required for the 
translation tables greatly reduced. Not only is this important in terms of increased 
operating efficiency, but it also insures that resort to unsecure external memory is 
not required. 

Here, the memory space is divided up into 256 MByte regions, each of 
which is associated with a common set of access characteristics.(e.g. access 
permissions, cacheability, bufferability). Of only one of these regions, only 1 
MByte requires a second level page table. Thus, since large regions of memory 
have common access characteristics, much smaller translation tables can be created 
within the available SRAM space. 

The access permissions indicate whether given information can be accessed 
fromthecorrespondingmemoryblock. The cacheability and bufferability attribute 
bits are used to determine if an accessed piece of information can be stored in 
cache or transferred through the write buffer. For example, the contents of the real 
hardware registers controlling the UARTs and other peripherals and I/O devices are 



35 



WO 01/57676 



PCT/US01/02560 



generally not allowed to be cached or buffered by the CPU subsystem. This would 
cause incorrect behavior of these peripherals due to the timing of when the accesses 
would actually occur. 

Moreover, for a secure system the page/section table information must be 
5 kept within the confines of the private area such that this information can not pass 

from memory to the device pins that can be examined by a logic analyzer. 

In the illustrated embodiment, where the memory is divided into 16, 256 
Mbyte blocks, a 32-bit register is created for storing the Level 1 AP bits, each two 
bit pairs mapping to a 256 MByte memory region. For example bits [1 :0] map to 
10 Region 1, bits [3:2] to Region 2, and so on. A 16-bit register is setup for holding 

a set of bits indicating the cacheability of each region for Level 1 . Another 16-bit 
register is setup for holding either a set of bits indicating the cacheability of each 
region. These registers are pointed to by the contents of the Translation Base 
Register in the MMU. 

15 A procedure 1600 for updating these registers, as well as for handling 

memory regions which have unique characteristics and constants, is shown in 
FIGURE 16 A. 

For a given 256 Mbyte region, a determination is made at Step 1601 as to 
whether it has a common set of access characteristics. If the determination is 
2 0 affirmative, then at Step 1602, the corresponding entry in the global Level 1 AP 

register is loaded with the appropriate AP bits. The corresponding entries in the 
global Level 1 bufferability and cacheability registers are similarly updated at Steps 
1602 and 1603. 

At Step 1 605, the procedure returns for the update of the register entries for 
25 the next memory region (block) requiring update. Initialization/update of the 

global access control registers is preferably done in a loop. The values in general 
do not change but can be updated if necessary during system processing. The full 
register values for entries that are not to be synthesized are updated as appropriate 
during system operation. For example they will need updated when a page of 
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memory is substituted for another when it is "swapped" out to disk or similar mass 
storage devices. 

For those memory blocks or registers which have a unique set of access 
characteristics at Step 1601, including access permissions, bufferability and 
cacheability bits, and physical address bits, a full 32-bit register is loaded with a 
complete Level 1 descriptor at Steps 1606 and 1607. The procedure again loops 
back at Step 1 608. This descriptor can include a course or fine page (Level 2) table 
address. 

Otherwise, a constant is pointed to in hardwired gates at Step 1608. The stored 
constant can be a fixed value or a base address to a Level 2 table. If a walk to 
Level 2 is not required at Step 1609 then the procedure loops back at Step 1610. 
Otherwise, at Step 161 1, a corresponding register in the Level 2 synthesized table 

is set up at Step 1611. 

A similar process is used to synthesize Level 2. Specifically, for each Level 
2 page, a register pointed-to by the Level 2 base address bits from the Level 1 
registers A global Level 2 AP register, along with Level 2 bufferability and 
cacheability registers are set up as before, for pages and sub-blocks having 
common characteristics. 

For a given page or set of pages, a determination is made at Step 1612 as 
to whether it has a common set of access characteristics. If the determination is 
affirmative, then a Step 1613, the corresponding entry in the global Level 2 AP 
register is loaded with the appropriate AP bits. The corresponding entries in the 
global Level 2 bufferability and cacheability registers are similarly updated at Steps 
1614 and 1615. 

At Step 1616, the procedure returns to Step 1601 for the update of the 
register entries for the next memory region (block) requiring update. 

For those Level 2 pages, sets of pages, blocks or registers which have a 
unique set of Level 2 access characteristics at Step 1612, including access 
permissions, bufferability and cacheability bits, and physical address bits, a full 
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32-bit register is loaded with a complete Level 2 descriptor at Step 1618. 
Otherwise, a constant is pointed to in hardwired gates at Step 1618. The stored 
constant can be a fixed value, base address, or the like. The procedure again loops 
back at Step 1619 to Step 1601. 
5 An exemplary synthesized page table walk is illustrated in FIGURE 12B. 

At Step 1620, the table walk is requested. This request could be in response to a 
TLB and/or cache miss. In this example, consider first the case where a second 
level of Table walk is not required at Step 1 621 . The Level 1 registers discussed 
above are then pointed-to by the translation base register in the MMU at Step 1 622. 

1 0 The Level 1 register entries are indexed using the table index bits from the virtual 

address (Step 1623). 

At Steps 1624 and 1625, a determination is made as to whether the return 
from the indexed entry in the Level 1 registers is either a full descriptor or a 
constant. The case in which the return is neither a constant nor a full descriptor 

1 5 will be considered first. 

At Step 1626 the access control bits in the first level global access registers 
(i.e. the AP, cacheability and bufferability bits) are retrieved. The table index from 
the virtual address are then transformed into physical address bits at Step 1627 by 
moving bit positions relative to the virtual address. 

20 In the preferred embodiment, the transformed virtual address bits for 

section entries will be the table index bits (bits 1 3:2 of the lookup word index into 
the 4096 entry level 1 page table) become bits (3 1 :20) of the result for the entry (1 
Mbyte memory region). The domain of the section will be defined by bits (13:10) 
of the memory location. For embodiments using the ARM920 or ARM720 MMU), 

25 several bits in the page table entries are always a constant 0 or 1 . 

The Level 1 descriptor is formed at Step 1 628 by merging the transformed 
address bits and the retrieved access control bits. The synthesized descriptor is 
returned at Step 1629 for updating the TLB and or cache. 
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Returning to Steps 1624 and 1625, the Level 1 entry can also be a full 
descriptor (Step 1630), or a constant (Step 1631). The descriptor or constant can 
be used immediately at Step 1632. 

Assume next that a Level 2 table reference is required at Step 1621. 

The Level 2 translation is similar to that performed when only a Level 1 
reference is required. At Step 1633, the Level 2 registers, setup as described 
above, are pointed to by a base address in the MMU. The specific register or entry 
is indexed using the table index bits from the virtual address at Step 1634. A 
determination is made at Steps 1635 and 1636 as to whether the indexed register 
(entry) contains a full descriptor or a constant. If a descriptor is found, then that 
descriptor is retrieved at Step 1637 and if a constant is found, that constant is 
retrieved at Step 1 638. The descriptor or constant can then be immediately used 
at Step 1639. 

If at Steps 1635 and 1636, neither a constant nor a descriptor is found, then 
at Step 1640, the second level access control registers are accessed and the 
corresponding access control bits retrieved at Step 1641 using the page index bits 
from virtual address. At Step 1 642, the page index bits from the virtual address are 
transformed into physical address bits by shifting bit positions. These physical 
address bits, along with the retrieved access control bits are merged at Step 1643 
to form a synthesized descriptor. The synthesized descriptor is returned at Step 
1644 for updating the TLB, performing a memory upon a cache miss, or similar 
operation. 

It should be noted that for brevity that the synthesized table walk has been 
described in terms of only Level 1 and Level 2 descriptor generation. It should be 
noted however, that additional levels of walk through below the second level can 
be implemented by repeated application of the inventive principles. 

In sum, according to the inventive concepts, only a 32-bit AP register and 
a pair of 16-bit registers for bufferability and cacheability are needed as first level 
tables. A second level table composed of a small page AP register and one-bit 
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cacheability and bufferability registers is all that is required for each second level 
page that must be addressed. 

The inventive concepts also advantageously allow for address translation 
and TLB update upon a cache miss through register emulation of memory similar 
5 to the cache miss emulation. Subsequently, the cache and/or TLB entries can be 

locked as described above for security. The preferred emulation process employs 
an alternate, emulated memory, such that the integrated circuit 100 internal 
memory can be spared for other tasks. The memory addresses of the page tables 
are preferably mapped inside the integrated circuit. A preferred procedure 

1 0 embodying these concepts is the Emulated Table Walk / TLB Update procedure 

1 700 illustrated in FIGURE 1 7. 

First, an emulated Level 1 Translation Register (table) (ELI TR) containing 
either Level 1 Descriptors or Level 2 base addresses is created at (Step 1701). 
Additionally, an emulated Level 1 Index Register (EL1IR), maintaining indices to 

15 the entries in the EL1TR, is setup in the alternate memory space (Step 1 702). The 

Translation Base Table (TTB) in the MMU is programmed to point to the emulated 
Level 1 table. Requests to this region will receive the contents of EL1TR with the 
index into the table matches ELI IR. If the index does not match the value returned 
will be an entry that causes an exception to occur. 

20 For those address translations continuing past the Section level, an 

emulated Level 2 Translation Register (EL2TR) containing Level 2 Descriptors is 
created in alternate memory (Step 1704), along with an emulated Level 2 Index 
Register holding the corresponding indices (Step 1 705). 

At Step 1706, virtual address is generated by prompting CPU 101 or 

2 5 through the use of an external address generator. If the cache and TLB have been 

flushed or cleaned, a cache/TLB miss will occur, and therefore, at Step 1707, the 
table walk procedure is invoked using the emulated level 1 table pointed-to by the 
MMU. Level 1 Table Index bits in the virtual address are compared with those in 
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EL1IR and the corresponding Level 1 information returned from EL1TR (Step 
1708). 

If the information is a descriptor (i.e. no Level 2 translation is required) at 
Step 1 708,then a Level 1 access is performed (Step 1 709) wherein the permissions 
in the descriptor are examined (Step 1710). If permission is not granted, then the 
operation aborts at Step 171 1. Otherwise, the physical address is generated from 
the Section address bits in the Level 1 descriptor, along with the Section Index 
from the virtual address, at Step 1712. The physical address can then be loaded 
into the TLB at Step 1713 to await locking and the corresponding data or 
instruction loaded into the appropriate cache. If it is determined at Step 1714 that 
the current entry in the TLB is not the last to be loaded, then at Step 1715 the 
procedure loops back to Step 1706 to initiate the next table walk. Otherwise, the 
TLB locking procedure is executed at Step 1716. 

If the information from EL1TR is found to be a base address to Level 2 at 
Step 1708, the Level 2 page walk is invoked at Step 1717. The EL2TR registers 
are accessed (Step 1718) using the base address from ELI TR. The specific register 
is indexed using the contents of corresponding EL2IR register by comparison 
against the index bits from the virtual address (Step 1719). The permissions in the 
returned Level 2 descriptor are examined at Step 1 720. If the access is not allowed, 
the access is aborted at Step 1721, otherwise the physical address is generated at 
Step 1 723 using the physical address bits from the Level 2 descriptor and index bits 
from the virtual address. The physical address is loaded into the TLB at Step 1 723 
to await locking. 

If the current TLB entry is the last to be loaded at Step 1 724, the TLB lock 
procedure can be invoked at Step 1725, otherwise, at Step 1726 the procedure 
jumps back to Step 1706 and the table walk for the next TLB entry is initiated. 

In some embodiments of integrated circuit 100, a bare CPU may be 
employed which does not include a Memory Management Unit (MMU) or 
hardware cache. For example, CPU core 1 01 could be based upon an ARM7tdmi 
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processor 102 alone, without cache 103 or MMU 104. When this option is 
selected, all software must be stored in memory in a flat memory space. However, 
this may require the use of external memory (e.g. NOR Flash, SRAM, DRAM). 
As discussed above, the data in an external memory has the significant 
5 disadvantage of potentially being accessed or analyzed by an unauthorized end 

user. 

In the embodiment of integrated circuit 100, which does not employ either 
a hardware cache or an MMU, the security code runs in an supervisor mode. In the 
supervisor mode, access to specific areas of memory and certain registers are 

1 o subject to a check against supervisor privilege. The security firmware preferably 

runs from internal memory, such as SRAM. In the supervisory mode, all other 
software/firmware is interpreted as running in the user mode and is therefore 
subject to supervisor privilege checking by the secured software. 

Although the invention has been described with reference to a specific 
1 5 embodiments, these descriptions are not meant to be construed in a limiting sense. 

Various modifications of the disclosed embodiments, as well as alternative 
embodiments of the invention will become apparent to persons skilled in the art 
upon reference to the description of the invention. It should be appreciated by 
those skilled in the art that the conception and the specific embodiment disclosed 

2 0 may be readily utilized as a basis for modifying or designing other structures for 

carrying out the same purposes of the present invention. It should also be realized 
by those skilled in the art that such equivalent constructions do not depart from the 
spirit and scope of the invention as set forth in the appended claims. 

It is therefore, contemplated that the claims will cover any such 
2 5 modifications or embodiments that fall within the true scope of the invention. 
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Port E data 


Svstem control 


Size 


oo 


oo 


oo 


oo 


oo 


oo 


oo 


oo 


m 


CO 


CO 




RW 


RW 


1 


RW 


RW 


RW 


• 


RW 


RW 


RW 


RW 


default 
























» 


o 


o 




o 


o 


o 




o 


o 


O 


o 


Name 


PADR 


PBDR 


1 


PDDR 


PADDR 


PBDDR 




PDDDR 


PEDR 


PEDDR 


SYSCON 


Address 


0x8000.0000 


0x8000.0001 


0x8000.0002 


0x8000.0003 


0x8000.0040 


0x8000.0041 


0x8000.0042 


0x8000.0043 


0x8000.0080 


0x8000.00C0 


0x8000.0100 



in 
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Comments 


System status flags 
register 1 


Expiansion and 
ROM memory 
configuration 
register 1 


Expansion and 


DRAM refresh 
period register 


Interrupt status 
register 1 


Interrupt mask 
register 1 


LCD control 


Size 


CM 
CO 


CM 

m 


m 


oo 


CM 
CO 


CM 
CO 


CO 




§ 


RW ! 


RW 


RW 


RD 


RW 


RW 


Default 


O 


o 


o 


o 


o 


o 




Name 


SYSFLG 


MEMCF 
Gl 


MEMCF 


DRFPR 


INSTRl 


INTMRl 


LCDCO 


Address 


0x8000.0140 


0x8000.0180 


0x8000.01 CO 


0x8000.0200 


0x8000.0240 


0x8000.0280 


0x8000.02C0 



LD 
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Comments 


o 

CO 

ts 

*c 

CO 
(L> 


Read/Write data to 


Realtime clock data 
register 


Realtime clock 
match register 


PWM pump control 
register 


CODEC data I/O 
register 


UARTl FIFO data 
register 


UART 1 bit rate 


Size 


VO 


VO 


m 


CM 




oo 


8W/11R 




I- 


RW 


RW 


RW 


RW 


RW 


RW 


RW 


RW 


Default 


o 


o 


i 


i 


o 


o 


o 


O 


Name 


TCID 


TC2D 


RTCDR 


RTCMR 


O 

O 


CODR 


UARTD 
Rl 


UBLCRl 


Address 


0x8000.0300 


0x8000.0340 


0x8000.0380 


0x8000.03C0 


0x8000.0400 


0x8000.0440 


0x8000.0480 


0x8000.04C0 
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Comments 


Synchronous serial 
I/O data register for 
master only SSI 


Least significant 
32-bit word of 
LCD palette 
register 


Most significant 
32-bit word of 
LCD palette 
register 


Write to clear all 
start up reason 
flags 


Size 


CM 
CO 


CO 


CO 






RW 


RW 


RW 


WR 


Default 


o 


© 


o 


i 


Name 


SYNCIO 


CO 

i * 


00 

S 


STFCLR 


Address 


0x8000.0500 


0x8000.0540 


0x8000.0580 


0x8000.05C0 
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Comments 


Write to clear 
battery low 
interrupt 


Write to clear 
media changed 
interrupt 


Write to clear tick 


Write to clear TCI 
interrupt 


Write to clear TC2 
interrupt 


Write to clear RTC 
match interrupt 


Size 


i 


i 




i 








WR 


WR 


WR 


WR 


WR 


WR 


Default 






i 






• 


Name 


BLEOI 


MCEOI 


TEOI 


TCIEOI 


TC2EOI 


RTCEOI 


Address 


0x8000.0600 


0x8000.0640 


0x8000.0680 


Ox8000.06CO 


0x8000.0700 


0x8000.0740 



in 
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Comments 


Write to clear 
UART modem 
status changed 
interrupt 


Write to clear 
CODEC sound 
interrupt 


Write to enter the 
Idle State 


Write to enter the 


Write will have no 
effect, read is 
undefined 


LCD frame buffer 
start address 


Size 








i 








WR 


WR 


WR 






RW 


Default 




i 


i 


i 




o 


Name 


UMSEOI 


COEOI 


HALT 


e 

CO 


Reserved 


FBADDR 


Address 


0x8000.0780 


0x8000.07C0 


0x8000.0800 


0x8000.0840 


0x8000.0880 
0x8000.OFFF 


0x8000.1000 



in 

48 



BNSOOCID: <WO 0157676A1_I_> 



WO 01/57676 



PCT7US01/02560 



Comments 


System control 
register 2 


System status 
register 2 


Interrupt status 
register 2 


Interrupt mask 
register 2 


Write will have no 
effect, read is 
undefined 


UART2 Data 
Register 


Size 


so 


* 




so 




8W/11R 


I- 


RW 


RD 


RD 


RW 




RW 


Default 


o 


o 


o 


o 




o 


Name 


SYSCON 


SYSFLG 


DMSTR2 


INTMR2 


Reserved 


UARTD 
R2 


Address 


0x8000.1100 


0x8000.1140 


0x8000.1240 


0x8000.1280 


0x8000. 12C0 - 
0x80000. 147F 


0x8000.1480 
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Comments 


UART2 bit rate and 
line control register 


Master/slave SSI2 
data Register 


Write to clear RX 
FIFO overflow flag 


Write to pop SSI2 
residual byte into 
RXFDFO 


Write to clear 
keyboard interrupt 


Size 






■ 






s- 


RW 


RW 


WR 


WR 


WR 


Default 


o 


o 






i 


Name 


UBLCR2 


SS2DR 


SRXEOF 


SS2POP 


KBDEOI 


Address 


0x8000. 14C0 


0x8000.1500 


0x8000.1600 


0x8000. 16C0 


0x8000.1700 



in 
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Comments 


Do not write to this 
location. A write 
will cause the 
processor to go into 
an unsupported 
power savings 
state. 


Write will have no 
effect, read is 
undefined 


Size 








WR 




Default 


i 


i 


Name 


Reserved 


Reserved 


Address 


0x8000.1800 


0x8000.1840- 
0x8000. IFFF 
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Comment 


External fast interrupt input 
(NEXTFIQpin) 


Battery low interrupt 


Watchdog expired interrupt 


Media changed interrupt 


Codec sound interrupt 


External interrupt input 1 
(NEINTI pin) 


External interrupt input 2 
(NEINT2 pin) 


Name 


EXTFIQ 


BLINT 


WEINT 


MCINT 


CSINT 


EINTl 


EINT2 


Bit in 

INTMRl 

and 

INTSRl 


o 






m 








Interrupt 


FIQ 


FIQ 


FIQ 


FIQ 


IRQ 


IRQ 


IRQ 



ID 
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Comment 


External interrupt input 3 (EINT3 
pin) 


TCI underflow interrupt 


TC2 undreflow interupt 


RTC compare match interrupt 


64 Hz tick interrupt 


Internal UART 1 transmit FIFO 
empty interrupt 


Internal UARTl receive FIFO full 
interrupt 


Internal UARTl modem status 
changed interrupt 


Name 


EINT3 


TCIOI 


TC20I 


RTCMI 


TINT i 


UTXINTl 


URXINTl 


UMSINT 


Bit in 

INTMRl 

and 

INTSRl 




CO 


OS 


o 










Interrupt 


IRQ 


IRQ 


IRQ 


IRQ 


IRQ 


IRQ 


IRQ 


IRQ 
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to 
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CO 



§ 3 
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Z H ■© H 

S 5 § 2 



S 
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a 
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E 
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E 



03 

3 
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S 



3 
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cy 

S 



1 2 

2 co 

H T3 H 

z s z 



0> 

.s 

CO 
CO 

e 

o. 



3 



e5 



-a 
> 

S 

CO 



CO 
CO 

> 
c3 



CO 
CO 



<L> 



s 

CO 

s 



CO 



CO 
CO 

o 
> 

CO 



3 



X 

GO 
CO 
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s 

£ 

o 
U 



s 



a 

+-> 

5 



a. 
s 

.£ 
a 



2 2 

H T3 H 
Z = Z 



>> 

Oh 

E 

o 
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CO 
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J3 

I 



X 

S 



<L> 

1 

o 
b 

<L> 
> 

"53 
o 

2i 
H 



g 
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s 

9 



S 

E 

S 
o 

U 



a> 
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Z 



2 
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Oh 
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s 
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E 
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Standby State 
Latency 


Including PLL/ocs. 
setting time, approx. 
0.25 s when 
FASTWAKE= 0, or 
approx. 500 fxs when 
FASTWAKE=l,or = 
Idle State if in 13 MHz 
mode with CLENSL 
set 


As above 


Idle State 
Latency 


Worst case 
20 ^s; if 
only single 
cycle 

instructions, 
less that, 1 


As above 


Operating 

State 

Latency 


Worst case 
latency of 20 


Worst case 
latency of 20 


Input State 


Not 

deglitched; 
must be 
active for 
20 ,us to be 
detected 


not 

deglitched 


Interrupt 
Pin 


NEXTFIQ 


NEINTl-2 



IT) 
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Standby State 
Latency 


As above 


As above (note 
difference if in 13 MHz 
mode with CLKENSL 
set) 


Idle State 
Latency 


As above 


Worst case 

80 ms; if 
only single 
cycle 

instructions 

61 MS : 


Operating 

State 

Latency 


Worst case 
latency of 20 

MS 


Worst case 
latenncy of 
80 ms 


Input State 


Not 

deglitched 


Deglitched 
by 16 kHz 
clock; must 
be active for 
at least 80 
MS to be 
detected 


Interrupt 
Pin 


EINT3 


MEDCHG 
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10 



15 



20 



TABLE 4 



PE[1J 



0 
0 



PE[[0] 



Boot Block 
(NCSO) 



32-bit 



8-bit 



16-bit 



Undefined 



TABLE 5A 


Address Range 


Chip Select 


0000.0000-OFFF.FFFF 


CS7 (Internal only) 


1 000.0000-1 FFF.FFFF 


CS6 (Internal only) 


2000.0000-2FFF.FFFF 


NCS5 


3000.0000-3FFF.FFFF 


NCS4 


4000.0000-4FFF.FFFF 


NCS3 


5000.0000-5FFF.FFFF 


NCS2 


6000.0000-6FFF.FFFF 


NCS1 


7000.0000-7FFF.FFFF 


NCSO 


TABLE 5B 


Address Range 


Chip Select 


0000.0000-OFFF.FFFF 


NCSO 


1 000.0000- 1 FFF.FFFF 


NCS1 


2000.0000-2FFF.FFFF 


NCS2 ! 


3O00.0O00-3FFF.FFFF 


NCS3 


4000.0000-4FFF.FFFF 


NCS4 


5000.0000-5FFF.FFFF 


NCS5 
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6000.0000-6FFF.FFFF 


CS 6 (Internal only) 


7000.0000-7FFF.F1W 


CS7 (Internal only) 
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Pin Name 


A[27]/DRA[0] 


A[26]/DRA[1] 


A[25]/DRA[2] 


A[24]/DRA[3] 


A[23]/DRA[4] 


A[22]/DRA[5] 


A[21]/DRA[6] 


A[20]/DRA[7] 


A[19]/DRA[8] 


ON 

e 

oo 
< 


DRAM 
Row 
x32 
Mode 


OIV 


< 


A12 


A13 


A14 


A15 


A16 


A17 


oo 

< 


A20 


DRAM 
Row 
xl6§ 
Mode 


6V 


AlO 


< 


A12 


A13 


A14 


A15 


A16 


A17 


A19 


DRAM 
Column 
x32 
Mode 


A2 


< 


A4 


< 


9V 


A7 


oo 

< 


A9 


A19 


A21 


DRAM 
Column 
xl6 
Mode 


* 

< 




< 


A4 


< 


A6 


A7 


A8 


A18 


A20 


DRAM 

Address 

Pins 


o 
















OO 


• Os 



in 
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Pin Name 


o 

1 

< 


A[16]/DRA[11] 


A[15]/DRA[12] 


DRAM 
Row 
x32 
Mode 


A22 


A24 


A26 


DRAM 
Row 
xl6§ 
Mode 


A21 


A23 


A25 


DRAM 
Column 
x32 
Mode 


A23 


v-> 
< 


A27 


DRAM 
Column 
xl6 
Mode 


A22 


A24 


A26 


DRAM 

Address 

Pins 


o 
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I—) 

9 



4> 

53 



E 

OX 

CO 



& 

a 

& 

01) 



OA 

a 

S 

C« 

<U 

■a 
■a 

-< 



o 
H 



CO 



C#5 

3 



CO 



u 

OX) 

IS 

s 

o 

V 



(X, 
Uh 
(X, 
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o 
c 

I 

o 
o 
o 
o 

o 
o 



On 
On 



£ 
U 



CO 



CO 

•5; 



2 (3 



s 
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Size of 
Segments 


512 

KBYTES 
512 

KBYTES 
512 

KBYTES 
512 

KBYTES 
512 

KBYTES 
512 

KBYTES 
512 

KBYTES 


Address Range of Segments 


nOOO.0000-n007.FFFF 
n010.0000-n017.FFFF 
n040.0000-n047.FFFF 
n050.0000-n057.FFFF 
nl00.O000-nl07.FFFF 
nllO.0000-nll7.FFFF 
nl40.0000-nl47-FFFF 
nl50.0000-nl57.FFFF 


Total Size 
of 

Bank 


4 Mbytes 


Address 
Configuration 


12 Row x 8 
Column 


CO 


16 Mbit 
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a 
a 

CO Ctt 



CO 



CO 


CO 


to 


CO 


CO 


CO 


CO 


CO 










<U 




0> 






Mbyt 


Mbyt 


Mbyt 


Mbyt 


Mbyt 


Mbyt 


Mbyt 




cm 


CM 


CN 


CM 




CN 


CM 



CO 



SO 



CO 

a 
E 
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CO 
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VO 
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Size of Segment(s) 


1 MByte 
1 MByte 
1 MByte 
1 MByte 
1 MByte 
1MByte 
1 MByte 
1 MByte 


32 Mbytes 


128 Mbytes 


Address Range of 
Segments(s) 


nOO0.O000-n00F.FFFF 

n020.0000-n02F.FFFF 

n080.0000-n08F.FFFF 

nOAO.OOOO-nOAF.FFFF 

n200.0000-n20F.FFFF 

n220.0000-n22F.FFFF 

n280.0000-n28F.FFFF 

n2A0.0000-n2AF.FFFF 


nOOO 0000-nlFF.FFFF 


n000.0000-n7FF.FFFF 


Total Size of Bank 


8 Mbytes 


32 Mbytes 


128 Mbytes 


Address Configuration 


13 Row x 9 Column 


12 Row x 12 Column 


13 Row x 13 Column 


Size 


64 Mbit 


256 Mbit 


1Gbit 
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NRESET 
RESET 


SELFREF 


Reset 


Reset 


Reset 


Reset 


Reset 


Reset 


Reset 


Reset 


O 


Reset 


Reset 


Reset 


NPOR 
RESET 


Off 


Reset 


Reset 


Reset 


Reset 


Reset 


Reset 


Reset 


Reset 


c 
O 


Reset 


Reset 


Reset 


Standby 


SELFREF 


to 
O 


Reset 


Off 

... 


to 
O 


Off 


Off 


fa 
o 


to 
O 


a 
O 


8 


to 
o 


Off 


Idle 


8 


c 
O 


e 
O 


C 

O 


c 
O 


c 
O 


c 
O 


a 
O 


8 


a 
O 


8 


a 
O 


Off 


M 

a 




























Operat 


8 


c 
O 


c 
O 


c 
O 


c 
O 


c 
O 


c 
O 


c 
O 


c 
O 


a 
O 


c 
O 


a 


c 

O 


Address (W/B) 


DRAM Control 


UARTs 


LCD FIFO 


LCD 


ADC Interface 


SS12 Interface 


DAI Interface 


Codec 


Timers 


RTC 


LED Flasher 


DC-to-DC 


CPU 



in 
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Reset 


Off 


Reset 


Off 


c 
O 


la 
o 


c 
O 


c 
O 


c 
O 


c 
O 


Interrupt 
Control 


PLUCLKEN 
Signal 
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TABLE 11 



Type 


Comments 


Referr 
ed To 

As 1 


Max, Transfer 
Speed 


SP/Microwire 1 


Master mode only 


ADC 

Interfa 

ce 


128 Kbps 


SPI/Microwire 2 


Master/slave mode 


SSI2 

Tnterfa 

ce 


512 Kbps 


MCP Interface 


CD quality DACs and 
ADCs 


DAI 


1.536 Mbps 


Codec Interface 


Only for use in the 
PLL clock mode 


Codec 
Interfa 
ce 


64 Kpbs 



TABLE 12 



SYSCON1 


SYSCON1 


13.0 MHz 


18. 432-73.728 


Bit 17 


Bit 16 


Operation 


MHz Operation 






ADCCLK Frequency 


ADCCLK Frequency 






(kHz) 


(kHz) 


0 


0 


4.2 


4 


0 


1 


16.9 


16 


1 


0 


67.7 


64 


1 


1 


135.4 


128 
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TABLE 17 



Bit 


Definition 


0 


Good Validation 


1 


ID all O's 


2 


CHK all 0's 


3 


ID all l's 


4 


CHK all l's 



TABLE 18 



10 



Name 


Address 


ID-CHK pair 


UNIQVAL 
UNIQVAL2 
UN1QVAL3 
UNIQVAL4 


0x2460 
0x2720 
0x2724 
0x2728 


UNIQID-UNIQCHK 
UNIQED2-UNIQCHK2 
UNIQED3 -UNIQCHK3 
UNIQID4-UNIQCHK4 


TABLE 19 


Name 


Address 


ID-CHK pair 


SECVAL1 
SECVAL2 


0x2540 
0x2544 


SECID1-SECCHK1 
SECID2-SECCHK2 
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WHAT IS CLAIMED: 

1 . A method of synthesizing translation tables comprising the steps of: 
setting up at least one register for storing information controlling access to 

a plurality of memory spaces; 

generating a virtual address including a pointer to selected information in 
the at least one register controlling access to a selected one of the memory spaces; 

accessing said selected information at said pointer from the at least one 
register; and 

generating a physical address to the selected one of the memory spaces 
from the information accessed from the at least one register. 

2. The method of Claim 1 wherein the selected information comprises access 
permissions. 

3. The method of Claim 1 wherein the selected information comprises 
cacheability and bufferability bits. 

4. The method Claim 1 wherein the at least one register comprises a first 
register for storing access permissions associated with each of the memory spaces, 
a second register for storing a cacheability bit associated with each of the memory 
spaces and a third register for storing a bufferability bit associated with each of the 
memory spaces. 

5 . The method of Claim 1 wherein the selected information accessed from the 
at least one register comprises a base address to at least one second level register 
controlling access to a selected part of a selected one of the memory spaces and 
said step of generating a physical address comprises the substeps of: 

accessing selected information in the at least one second level register using^ 
the base address and a table index from the virtual address; and 

generating the physical address from the selected information accessed 
from the at least one second level register and page index bits from the virtual 
address. 
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6. The method of Claim 1 wherein said information includes for each of the 
memory spaces a pair of access permission bits, a bufferability bit and a 
cacheability bit. 

7. A method of performing an emulated translation table walk comprising the 
steps of: 

emulating a translation register including a plurality of entries populated 
with descriptors; 

emulating an index register storing indices associated with the entries of the 
emulated translation register; 

pointing to the emulated translation register with a translation base pointer; 

generating an address including index bits to the emulated translation 
register; 

comparing the index bits from the address with the indices stored in the 

index register; and 

selectively accessing a corresponding descriptor in the translation table in 

response to said step of comparing. 

8. The method of Claim 7 wherein said step of generating an address 
comprises the step of generating a virtual address forcing a miss to an associated 
cache. 

9. The method of Claim 7 wherein the descriptors comprise selected physical 
address bits and access permissions and said method further comprises the steps 
of: 

determining from the permissions from the descriptor selectively accessed 
from the emulated translation table whether a corresponding access to memory is 
allowed; and 

if the access is allowed, generating a physical address to a location in 
memory using the physical address bits from the accessed descriptor. 

1 0. The method of Claim 7 and further comprising the steps of: 
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populating selected entries of the translation register with second level base 
addresses; 

emulating a second level translation register including a plurality of entries 
populated with second level descriptors; 
5 emulating a second level index register populated with indices to 

corresponding entries in the second level translation register; 

accessing the second level translation register with a base address from the 
translation register; 

comparing index bits from the virtual address with indices in the second 
10 level index register; and 

in response to said step of comparing, selectively accessing a second level 
descriptor from the corresponding entry in the second level translation table. 

1 1 . The method of Claim 10 wherein the second level descriptors comprise 
selected second level physical address bits and second level access permissions and 

15 said method further comprises the steps of: 

determining from the second level permissions of the descriptor selectively 
accessed from the emulated second level translation table whether a corresponding 
access to memory is allowed; and 

if the access is allowed, generating a physical address to a location in 
20 memory using the second level physical address bits from the accessed second 

level descriptor. 

12. A system comprising: 

a memory partitioned into a plurality of memory spaces accessible by a set 
of physical addresses; 
25 a register; and 

control circuitry operable to: 

store an access permission to a corresponding one of said memory 
spaces in said register; 

80 



BNSDOCID: <WO 01S7676A1 J_> 



WO 01/57676 



PCT/US01/02560 



access the permission from said register in response to a virtual 

address; and 

generate a physical address to the corresponding one of the memory 
spaces from the accessed permission. 

13. The system of Claim 12 wherein said control circuitry comprises a 
microprocessor. 

14. The system of Claim 12 and further comprising a second register accessible 
by said control circuitry for storing cacheability bits associated with each of said 
memory spaces. 

15. The system of Claim 14 and further comprising a third register accessible 
by said control circuitry for storing bufferability bits associated with each of said 
memory spaces. 
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